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MORPHOGENY PROTEIN SOLUBLE COMPLEX AND COMPOSITION THEREOF. 



Field of the Invention 

The present invention relates generally to 
5 morphogenic proteins and, more particularly, to 
compositions having improved solubility in aqueous 
solvents * 

Background of the Invention 

10 Morphogenic proteins ("morphogens" ) are well known 

and described in the art. See, for example, U.S. Pat. 
Nos. 4, 968,590; 5,011,691; 5,018,753; PCT US92/01968 and 
PCT US92/07432; as well as various articles published in 
the scientific literature, including Ozkaynak et al. 

15 {1992) J.Biol. Chem. 267:25220-25227 and Ozkaynak et .1. 
(1991) Biochem. Biophys. Res. Comm. 179 x116-123. The 
art has described how to isolate morphogenic proteins 
from bone, how to identify genes encoding these proteins 
and how to express them using recombinant DNA technology. 

20 The morphogenic proteins are capable of inducing 

endochondral bone formation and other tissue formation in 
a mammal when they are properly folded, dimerized and 
disulfide bonded to produce a dimeric species having the 
appropriate three dimensional conformation. The proteins 

25 have utility in therapeutic applications, either by 

direct or systemic administration. Where bone induction 
is desired, for example, the morphogen typically is 
provided to the desired site for bone formation in a 
mammal in association with a suitable matrix having the 

30 appropriate conformation to allow the infiltration, 

proliferation and differentiation of migrating progenitor 
cells. The morphogenic protein adsorbed to the surfaces 
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of a suitable matrix is generally referred to in the art 
as an osteogenic device. The proteins can be isolated 
from bone or, preferably, the gene encoding the protein 
is produced recombinantly in a suitable host cell* 

5 

The morphogen precursor polypeptide chains share a 
common structural motif, including a N- terminal signal 
sequence and pro region, both of which are cleaved to 
produce a mature sequence, capable of disulfide bonding 

10 and comprising an N-terminal extension and a C-terminal 
domain whose amino acid sequence is highly conserved 
among members of the family. In their mature dimeric 
forms, the morphogens typically are fairly insoluble 
under physiological conditions. Increasing the solubility 

15 of these proteins has significant medical utility as it 
would enhance systemic administration of morphogens as 
therapeutics. Various carrier proteins, including serum 
albumin and casein are known to increase the solubility 
of morphogens (see, for example, PCT US92/07432). PCT 

20 US92/05309 (WO 93/00050) discusses the use of various 
solubilizing agents, including various amino acids and 
methyl esters thereof, as well as guanidine, sodium 
chloride and heparin, to increase the solubility of 
mature dimeric BMP2. 

25 

Improved methods for the recombinant expression of 
morphogenic proteins is an ongoing effort in the art. It 
is an object of this invention to provide an improvement 
in the methods for producing and purifying morphogenic 
30 proteins having high specific activity, and for 
formulating compositions and osteogenic devices 
comprising these proteins. Another object is to provide 
soluble forms of morphogenic proteins consisting 
essentially of amino acid sequences derived from 
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morphogenic proteins. Another object is to provide 
formulations which stabilize the soluble complex of 
morphogenic proteins. Still another object is to provide 
means for distinguishing between soluble forms of the 
5 protein and the mature morphogenic species, to provide 
means for quantitating the amounts of these proteins in a 
fluid, including a body fluid, such as serum, 
cerebro-sprinal fluid or peritoneal fluid, and to provide 
polyclonal and monoclonal antibodies capable of 
10 distinguishing between these various species. 

Another object is to provide antibodies and 
biological diagnostic assays for monitoring the 
concentration of morphogens and endogenous anti-morphogen 

15 antibodies present in a body fluid and to provide kits 
and assays for detecting fluctuations in the 
concentrations of these proteins in a body fluid. U.S. 
Patent No. 4,857,456 and Urist et al. (1984) Froc. Soc. 
Exp. Biol. Med. 176 :472-475 describe a serum assay for 

20 detecting a protein purported to be a bone morphogenetic 
protein. The protein is not a member of the morphogen 
family of proteins described herein, differing in 
molecular weight, structural characteristics and 
solubility from these proteins. 

25 

Summary of the Invention 

It now has been discovered that morphogenic protein 
secreted into cultured medium from mammalian cells 
contains as a significant fraction of the secreted 
30 protein a soluble form of the protein, and that this 
soluble form comprises the mature dimeric species, 
including truncated forms thereof, noncovalently 
associated with at least one, and preferably two pro 
domains. It further has been discovered that antibodies 
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can be used to discriminate between these two forms of 
the protein. These antibodies may be used as part of a 
purification scheme to selectively isolate the mature or 
the soluble form of morphogenic protein, as well as to 
5 guantitate the amount of mature and soluble forms 

produced. These antibodies also may be used as part of 
diagnostic treatments to monitor the concentration of 
morphogenic proteins in solution in a body and to detect 
fluctuations in the concentration of the proteins in 
10 their various forms. The antibodies and proteins also 
may be used in diagnostic assays to detect and monitor 
concentrations of endogenous anti-morphogen antibodies to 
the various forms of these proteins in the body. 

15 An important embodiment of the invention-is a dimeric 

protein comprising a pair of polypeptide subunits 
associated to define a dimeric structure having 
morphogenic activity. As defined herein and in parent, 
related applications, morphogens generally are capable 

20 of all of the following biological functions in a 

morphogenically permissive environment: stimulating 
proliferation of progenitor cells; stimulating the 
differentiation of progenitor cells; stimulating the 
proliferation of differentiated cells; and supporting the 

25 growth and maintenance of differentiated cells. 

Each of the subunits of the dimeric morphogenic 
protein comprises at least the 100 amino acid peptide 
sequence having the pattern of seven or more cysteine 
30 residues characteristic of the morphogen family. 

Preferably, at least one of the subunits comprises the 
mature form of a subunit of a member of the morphogen 
family, or an allelic, species, chimeric or other 
sequence variant thereof, noncoval ntly compl xed with a 
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peptide comprising part or all of a pro region of a 
member of the morphogen family, or an allelic, species, 
chimeric or other sequence variant thereof. The pair of 
subunits and one or, preferably, two pro region peptides, 
5 together form a complex which is more soluble in aqueous 
solvents than the uncomplexed pair of subunits. 

Preferably, both subunits comprise a mature form of a 
subunit of a member of the morphogen family or an 

10 allelic, species, chimeric or other sequence variant 
thereof, and both subunits are noncovalently complexed 
with a peptide comprising a pro region, or a fragment 
thereof. Most preferably, each subunit is the mature 
form of human OP-1, or a species, allelic or other 

15 sequence variant thereof, and the pro region peptide is 
the entire or partial sequence of the pro region of human 
OP-1, or a species, allelic, chimeric or other sequence 
variant thereof. Currently, preferred pro regions are 
full length forms of the pro region. Pro region 

20 fragments preferably include the first 18 amino acids of 
the pro sequence. Other useful pro region fragments are 
truncated sequences of the intact pro region sequence, 
the truncation occurring at the proteolytic cleavage site 
Arg-Xaa-Xaa-Arg. As will be appreciated by those having 

25 ordinary skill in the art, useful sequences encoding the 
pro region may be obtained from genetic sequences 
encoding known morphogens. Alternatively, chimeric pro 
regions can be constructed from the sequences of one or 
more known morphogens. Still another option is to create 

30 a synthetic sequence variant of one or more known pro 
region sequences. 
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As used herein, the mature form of a morphogen 
protein subunit includes the intact C-terminal domain and 
intact or truncated forms of the N- terminal extensions. 
For example, useful mature forms of OP-1 include dimeric 
5 species defined by residues 293-431 of Seg ID No. 1, as 
well as truncated sequences thereof, including sequences 
defined by residues 300-431, 313-431, 315-431, 316-431 
and 318-431. Note that this last sequence retains only 
about the last 10 residues of the N-terminal extension 

10 sequence. Fig. 2 presents the N-terminal extensions for 
a number of preferred morphogen sequences. Canonical 
Arg-Xaa-Xaa-Arg cleavage sites where truncation may occur 
are boxed or underlined in the figure. As will be 
appreciated by those having ordinary skill in the art, 

15 mature dimeric species may include subunit combinations 
having different N-terminal truncations. 

Other soluble forms of morphogens include dimers of 
the uncleaved pro forms of these proteins (see below), as 
20 well as "hemi-dimers" wherein one subunit of the dimer is 
an uncleaved pro form of the protein, and the other 
subunit comprises the mature form of the protein, 
including truncated forms thereof, preferably 
noncovalently associated with a cleaved pro domain. 

25 

The soluble proteins of this invention also are 
useful in the formation of therapeutic compositions for 
administration to a mammal, particularly a human, and for 
the development of biological assays for monitoring the 
30 concentration of these proteins and endogenous antibodies 
to these proteins in cell samples and body fluids, 
including, but not limited to, serum, cerebrospinal fluid 
and peritoneal fluid. 
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The foregoing and other objects, features and 
advantages of the present invention will be made more 
apparent from the following detailed description of the 
invention. 

5 

Brief Description of the Drawings 

Fig. 1 is a schematic representation of a morphogen 
polypeptide chain as expressed from a nucleic acid 

10 encoding the sequence, wherein the cross-hatched region 
represents the signal sequence; the stippled region 
represents the pro domain; the hatched region represents 
the N- terminus ( "N- terminal extension" ) of the mature 
protein sequence; and the open region represents the 

15 C-terminal region of the mature protein sequence defining 
the conserved seven cysteine domain, the conserved 
cysteines being indicated by vertical hatched lines; 

Fig. 2 lists the sequences of the N-terminal 
20 extensions of the mature forms of various morphogens; and 

Fig. 3 is a gel filtration column elution profile of 
a soluble morphogen (OP-1) produced and purified from a 
mammalian cell culture by IMAC, S-Sepharose and S-200HR 
25 chromatography in TBS (Tris-buf fered saline), wherein V Q 
is the void volume, ADH is alcohol dehydrogenase (MW 150 
kDa), BSA is bovine serum albumin (MW 67 kDa), CA is 
carbonic anhydrase (MW 2 9 kDa) and CytC is cytochrome C 
(MW 12.5 kDa). 

30 
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Detailed Description 

A soluble form of morphogenic proteins now has been 
discovered wherein the proteins consist essentially of 
5 the amino acid sequence of the protein. The soluble form 
is a non-covalently associated complex comprising the pro 
domain or a fragment thereof, noncovalently associated or 
complezed with a dimeric protein species having 
morphogenic activity, each polypeptide of the dimer 

10 having less than 200 amino acids and comprising at least 
the C-terminal six, and preferably seven cysteine 
skeleton defined by residues 330-431 and 335-431, 
respectively, of Seq. ID No. 1. Preferably, the 
polypeptide chains of the dimeric species comprise the 

15 mature forms of these sequences, or truncated forms 

thereof. Preferred truncated forms comprise the intact 
C-terminal domain and at least 10 amino acids of the N- 
terminal extension sequence. The soluble forms of these 
morphogenic proteins may be isolated from cultured cell 

20 medium, a mammalian body fluid, or may be formulated in 
vitro . 

In vivo , under physiological conditions, the pro 
domain may serve to enhance the transportability of the 

25 proteins, and/or to protect the proteins from proteases 
and scavenger molecules, including antibodies. The pro 
domains also may aid in targeting the proteins to a 
particular tissue and/or to present the morphogen to a 
morphogen cell surface receptor by interaction with a 

30 co-receptor molecule. The isolated proteins may be used 
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in therapeutic formulations/ particularly for oral or 
parenteral administration, and in the development of 
diagnostic and other tissue evaluating kits and assays to 
monitor the level of endogenous morphogens and endogenous 
5 anti-raorphogen antibodies. 

Detailed descriptions of the utility of these 
morphogens in therapies to regenerate lost or damaged 
tissues and/or to inhibit the tissue destructive 

10 effects of tissue disorders or diseases, are provided 
in international applications US92/01968 (W092/15323) ; 
US92/07358 (W093/04692) and US92/07432 (WO93/05751) the 
disclosures of which are incorporated herein by 
reference. Morphogens, including the soluble morphogen 

15 complexes of this invention, are envisioned to have 
particular utility as part of therapies for 
regenerating lost or damaged bone, dentin, periodontal, 
liver, cardiac, lung and nerve tissue, as well as for 
protecting these tissues from the tissue destructive 

20 effects associated with an immunological response. The 
proteins also are anticipated to provide a tissue 
protective effect in the treatment of metabolic bone 
disorders, such as osteoporosis, osteomalacia and 
osteosarcoma; in the treatment of liver disorders, 

25 including cirrhosis, hepatitis, alcohol liver disease 
and hepatic encephalopathy; and in the treatment or 
prevention of ischemia reper fusion-associated tissue 
damage, particularly to nerve or cardiac tissue. 



30 
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Presented below are detailed descriptions of useful 
soluble morphogen complexes of this invention, as well as 
how to make and use them. 

5 i. Useful Soluble Morphogen Complexes - 
Protein Considerations 

Among the morphogens useful in this invention are 
proteins originally identified as osteogenic proteins, 

10 such as the OP-1, OP-2 and CBMP2 proteins, as well as 
amino acid sequence-related proteins such as DPP (from 
Drosophila), Vgl ( f rom Xenopus ) , Vgr-1 (from mouse, see 
U.S. 5,011,691 to Oppermann et al.)/ GDF-1 (from mouse, 
see Lee (1991) PNAS 88 :4250-4254), 60A protein (from 

15 Drosophila, Seq. ID No. 24, see Wharton et al. (1991) 
PNAS 88:9214-9218), and the recently identified OP-3. 

The members of this family, which are a subclass of 
the TGF-p super-family of proteins, share characteristic 

20 structural features, represented schematically in Fig. 1, 
as well as substantial amino acid sequence homology in 
their C- terminal domains, including a conserved seven 
cysteine structure. As illustrated in the figure, the 
proteins are translated as a precursor polypeptide 

25 sequence 10, having an N-terminal signal peptide sequence 
12, (the "pre pro" region, indicated in the figure by 
cross-hatching), typically less than about 30 residues, 
followed by a "pro" region 14, indicated in the figure by 
stippling, and which is cleaved to yield the mature 

30 sequence 16. The mature sequence comprises both the 
conserved C-terminal seven cysteine domain 20, and an 
N-terminal sequence 18, referred to herein as an 
^-terminal xtension, and which varies significantly in 
sequence between the various morphogens. Cysteines are 
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represented in the figure by vertical hatched lines 22. 
The polypeptide chains dimerize and these dimers 
typically are stabilized by at least one interchain 
disulfide bond linking the two polypeptide chain 
5 subunits . 

The signal peptide is cleaved rapidly upon 
translation, at a cleavage site that can be predicted in 
a given sequence using the method of Von Beijne ((1986) 

10 Nucleic Acids Research 14 ;4683-4691,) The "pro" form of 
the protein subunit, 24, in Fig. 1, includes both the pro 
domain and the mature domain, peptide bonded together. 
Typically, this pro form is cleaved while the protein is 
still within the cell, and the pro domain remains 

15 noncovalently associated with the mature form of the 

subunit to form a soluble species that appears to be the 
primary form secreted from cultured mammalian cells. 
Typically, previous purification techniques utilized 
denaturing conditions that disassociated the complex. 

20 

Other soluble forms of morphogens secreted from 
mammalian cells include dimers of the pro forms of these 
proteins, wherein the pro region is not cleaved from the 
mature domain, and "hemi-dimers " , wherein one subunit 
25 comprises a pro form of the polypeptide chain subunit and 
the other subunit comprises the cleaved mature form of 
the polypeptide chain subunit (including truncated forms 
thereof), preferably noncovalently associated with a 
cleaved pro domain. 

30 

The isolated pro domain typically has a substantial 
hydrophobic character, as determined both by analysis of 
the sequence and by characterization of its properties in 
solution. The isolated pro regions alone typically are 
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not significantly soluble in aqueous solutions, and 
require the presence of denaturants, e.g., detergents, 
urea, guanidine HC1, and the like, and/or one or more 
carrier proteins. Accordingly, without being limited to 
5 any given theory, the non-covalent association of the 
cleaved pro region with the mature morphogen dimeric 
species likely involves interaction of a hydrophobic 
portion of the pro region with a corresponding 
hydrophobic region on the dimeric species, the 
10 interaction of which effectively protects or "hides" an 
otherwise exposed hydrophobic region of the mature dimer 
from exposure to aqueous environments, enhancing the 
affinity of the mature dimer species for aqueous 
solutions • 

15 

Morphogens comprise a subfamily of proteins within 
the TGF-p superfamily of structurally related proteins. 
Like the morphogens described herein, TGF-p also has a 
pro region which associates non-covalently with the 

20 mature TGF-p protein form. However, unlike the 
morphogens, the TGF-0 pro region contains numerous 
cysteines and forms disulfide bonds with a specific 
binding protein. The TGF-pi pro domain also is 
phosphorylated at one or more mannose residues, while the 

25 morphogen pro regions typically are not. 

Useful pro domains include the full length pro 
regions described below, as well as various truncated 
forms hereof, particularly truncated forms cleaved at 
30 proteolytic Arg-Xaa-Xaa-Arg cleavage sites. For example, 
in OF-1, possible pro sequences include sequences defined 
by residues 30-292 (full length form); 48-292; and 
158-292. Soluble OP-1 complex stability is enhanced when 
the pro region comprises the full length form rather than 
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10 



a truncated form, such as the 48-292 truncated form, in 
that residues 30-47 show sequence homology to the 
N-terminal portions of other morphogens, and are believed 
to have particular utility in enhancing complex stability 
for all morphogens. Accordingly, currently preferred pro 
sequences are those encoding the full length form of the 
pro region for a given morphogen (see below). Other pro 
sequences contemplated to have utility include 
biosynthetic pro sequences, particularly those that 
incorporate a sequence derived from the N-terminal 
portion of one or more morphogen pro sequences. 



Table I, below, describes the various preferred 
morphogens identified to date, including their 

15 nomenclature as used herein, the sequences defining the 
various regions of the subunit sequences, their Seq. ID 
references, and publication sources for their nucleic 
acid and amino acid sequences. The disclosure of these 
publications is incorporated herein by reference. The 

20 mature protein sequences defined are the longest 
anticipated forms of these sequences. As described 
above, shorter, truncated forms of these sequences also 
are contemplated. Preferably, truncated mature sequences 
include at least 10 amino acids of the N-terminal 

25 extension. Fig. 2 lists the N-terminal extensions for a 
number of the preferred morphogen sequences described 
below. Arg-Xaa-Xaa-Arg cleavage sites that may yield 
truncated sequences of the mature subunit form are boxed 
or underlined in the figure. 



30 
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TABLE I 



"OP-l" Refers generically to the group of 

morphogenically active proteins expressed 
5 from part or all of a DNA sequence 

encoding OP-1 protein, including allelic 
and species variants thereof , e.g., human 
OP-1 ("hOP-l"), or mouse OP-1 ( M mOP-l".) 
The cDNA sequences and the amino acids 

10 encoding the full length proteins are 

provided in Seq. Id Nos. 1 and 2 (hOPl) 
and Seq. ID Nos. 3 and 4 (mOPl.) The 
mature proteins are defined by residues 
293-431 (hOPl) and 292-430 (mOPl), wherein 

15 the conserved seven cysteine skeleton is 

defined by residues 330-431 and 329-430, 
respectively, and the N- terminal 
extensions are defined by residues 293-329 
and 292-329, respectively. The "pro" 

20 regions of the proteins, cleaved to yield 

the mature, morphogenically active 
proteins, are defined essentially by 
residues 30-292 (hOPl) and residues 30-291 
(mOPl). 

25 

"OP-2" refers generically to the group of active 

proteins expressed from part or all of a 
DNA sequence encoding OP-2 protein, 
including allelic and species variants 
30 thereof, e.g., human OP-2 ("hOP-2" ) or 

mouse OP-2 ("mOP-2".) The full length 
proteins are provided in Seq. ID Nos. 5 
and 6 (hOP2) and Seq. ID Nos. 7 and 8 
(mOP2.) The mature proteins are defined 
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essentially by residues 264-402 (hOP2) and 
261-399 (m0P2), wherein the conserved 
seven cysteine skeleton is defined by 
residues 301-402 and 298-399, 
respectively, and the N- terminal 
extensions are defined by residues 264-300 
and 261-297, respectively. The "pro" 
regions of the proteins, cleaved to yield 
the mature, morphogenically active 
proteins likely are defined essentially by 
residues 18-263 (hOP2) and residues 18-260 
(mOP2). (Another cleavage site also 
occurs 21 residues upstream for both OP-2 
proteins . ) 



"OP-3" refers generically to the group of active 

proteins expressed from part or all of a 
DNA sequence encoding OP-3 protein, 
including allelic and species variants 

20 thereof, e.g., mouse OP-3 ("mOP-S".) The 

full length protein is provided in Seq. ID 
No. 9. The mature protein is defined 
essentially by residues 261-399 or 
264-399, wherein the conserved seven 

25 cysteine skeleton is defined by residues 

298-399 and the N-terminal extension is 
defined by residues 264-297 or 261-297. 
The "pro" region of the protein, cleaved 
to yield the mature, morphogenically 

30 active proteins likely is defined 

essentially by residues 20-262. 
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M BMP2/BMP4" refers to protein sequences encoded by the 
human BMP 2 and BMP 4 genes. The amino acid 
sequence for the full length proteins, 
referred to in the literature as BMP2A and 
5 BMP2B, or BMP 2 and BMP4, appear in Seq. ID 

Nos. 10 and 11, respectively, and in 
Wozney, et al. (1988) Science 242: 1528- 
1534. The pro domain for BMP 2 (BMP2A) 
likely includes residues 25-248 or 25-282; 

10 the mature protein, residues 249-396 or 

283-396, of which residues 249-296/283-296 
define the N-terminal extension and 295- 
396 define the C- terminal domain. The pro 
domain for BMP 4 (BMP2B) likely includes 

15 residues 25-256 or 25-292; the -mature 

protein, residues 257-408 or 293-408, of 
which 257-307/293-307 define the N- 
terminal extension, and 308-408 define the 
C-terminal domain. 

20 

M DPP" refers to protein sequences encoded by the 

Drosophila DPP gene. The amino acid 
sequence for the full length protein, 
including the mature form and the pro 

25 region, appears in Seq. ID No. 12 and in 

Padgett, et al (1987) Nature 325 : 81-84. 
The pro domain likely extends from the 
signal peptide cleavage site to residue 
456; the mature protein likely is defined 

30 by residues 457-588, where residues 457- 

586 define the N-terminal extension and 
487-588 define the C-terminal domain. 
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refers to protein sequences encoded by the 
Xenopus Vgl gene. The amino acid sequence 
for the full length protein, including the 
mature form and the pro region, appears in 
Seq.ID No. 13 and in Weeks (1987) Cell 51: 
861-867. The pro domain likely extends 
from the signal peptide cleavage site to 
residue 246; the mature protein likely is 
defined by residues 247-360, where 
residues 247-258 define the N-terminal 
extension, and residues 259-360 define the 
C- terminal domain. 

refers to protein sequences encoded by the 
murine Vgr-1 gene. The amino acid 
sequence for the full length protein, 
including the mature form and the pro 
region, appears in Seq. ID No. 14 and in 
Lyons, et al, (1989) PNAS 86: 4554-4558. 
The pro domain likely extends from the 
signal peptide cleavage site to residue 
299; the mature protein likely is defined 
by residues 300-438, where residues 
300-336 define the N-terminal extension 
and residues 337-438 define the 
C- terminus . 

refers to protein sequences encoded by the 
human GDF-1 gene. The cDNA and encoded 
amino sequence for the full length protein 
is provided in Seq. ID. No. 15 and Lee 
(1991) PNAS 88 :4250-4254. The pro domain 
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likely extends from the signal peptide 
cleavage site to residue 214; the mature 
protein likely is defined by residues 215- 
372, where residues 215-256 define the N- 
terminal extension and residues 257-372 
define the C-terminus. 

refers to protein sequences encoded by the 
Drosophila 60A gene. The amino acid 
sequence for the full length protein 
appears in Seq. ID No. 16 and in Wharton 
et al. (1991) PNAS 88:9214-9218) The pro 
domain likely extends from the signal 
peptide cleavage site to residue 324; the 
mature protein likely is defined by 
residues 325-455, wherein residues 325-353 
define the N-terminal extension and 
residues 354-455 define the C-terminus. 

refers to protein sequences encoded by the 
human BMP3 gene. The amino acid sequence 
for the full length protein, including the 
mature form and the pro region, appears in 
Seq. ID No. 17 and in Wozney et al. (1988) 
Science 242 ; 1528-1534. The pro domain 
likely extends from the signal peptide 
cleavage site to residue 290; the mature 
protein likely is defined by residues 291- 
472, wherein residues 291-370 define the 
N-terminal extension and residues 371-472 
define the C-terminus. 
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"BMP 5" refers to protein sequences encoded by the 

human BMP 5 gene. The amino acid sequence 
for the full length protein, including the 
mature form and the pro region, appears in 
5 Seq.ID No. IB and in Celeste, et al. 

(1990) PNAS 87 : 9843-9847. The pro domain 
likely extends from the signal peptide 
cleavage site to residue 316; the mature 
protein likely is defined by residues 
10 317-454, where residues 317-352 define the 

N- terminus and residues 352-454 define the 
C-terminus • 

"BMP6" refers to protein sequences encoded by the 

15 human BMP 6 gene. The amino acid sequence 

for the full length protein, including the 
mature form and the pro region, appears in 
Seq. ID No. 16 and in Celeste, et al. 
(1990) PNAS 87: 9843-5847. The pro domain 
20 likely includes extends from the signal 

peptide cleavage site to residue 374; the 
mature sequence likely includes 
residues 375-513, where residues 375-411 
define the N-terminus and residues 412-513 
25 define the C-terminus. 

Note that the OP-2 and OP-3 proteins have an 
additional cysteine residue in the C-terminal region 
(e.g., see residue 338 in these sequences), in addition 
30 to the conserved cysteine skeleton in common with the 
other proteins in this family. The GDF-1 protein has a 
four amino acid insert within the conserved skeleton 
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("Gly-Gly-Pro-Pro") but this insert likely does not 
interfere with the relationship of the cysteines in the 
folded structure. In addition, the CBMP2 proteins are 
missing one amino acid residue within the cysteine 
5 skeleton. 

The dimeric morphogen species are inactive when 
reduced, but are active as oxidized homodimers and when 
oxidized in combination with other morphogens of this 

10 invention. Thus, as defined herein, a morphogen useful 
in a soluble morphogen complex is a dimeric protein 
comprising a pair of polypeptide chains, wherein each 
polypeptide chain has less than 200 amino acids and 
comprises at least the C-termirial six, preferably seven 

15 cysteine skeleton defined by residues 335-431 of Seq. 
ID No. 1, including functionally equivalent 
arrangements of these cysteines (e.g., amino acid 
insertions or deletions which alter the linear 
arrangement of the cysteines in the sequence but not 

20 their relationship in the folded structure), such that, 
when the polypeptide chains are folded, the dimeric 
protein species comprising the pair of polypeptide 
chains has the appropriate three-dimensional structure, 
including the appropriate intra- or inter-chain 

25 disulfide bonds such that the protein is capable of 
acting as a morphogen as defined herein. The 
solubility of these structures is improved when the 
mature dimeric form of a morphogen, in accordance with 
the invention, is complexed with at least one, and 

30 preferably two, pro domains. 
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Various generic sequences (Generic Sequence 1-6) 
defining preferred C-terminal sequences useful in the 
soluble morphogens of this invention are described in 
USSN 07/923, 780 , incorporated herein above by 
5 reference. Two currently preferred generic sequences 
are described below. 

Generic Sequence 7 (Seq. ID No. 20) and Generic 
Sequence 8 (Seq. ID No. 21) disclosed below, 

10 accommodate the homologies shared among preferred 
morphogen protein family members identified to date, 
including OP-1, OP-2, OP-3, CBMP2A, CBMP2B, BMP3, 60A, 
DPP, Vgl, BMP5, BMP6, Vrg-1, and GDF-1. The amino acid 
sequences for these proteins are described herein (see 

15 Sequence Listing) and/or in the art, as well as in PCT 
publication US 92/07358, (WO93/04692) , for example. 
The generic sequences include both the amino acid 
identity shared by these sequences in the C-terminal 
domain, defined by the six and seven cysteine skeletons 

20 (Generic Sequences 7 and 8, respectively), as well as 
alternative residues for the variable positions within 
the sequence. The generic sequences allow for an 
additional cysteine at position 41 (Generic Sequence 7) 
or position 46 (Generic Sequence 8), providing an 

25 appropriate cysteine skeleton where inter- or 

intramolecular disulfide bonds can form, and containing 
certain critical amino acids which influence the 
tertiary structure of the proteins. 
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Generic Sequence 7 

Leu Xaa Xaa Xaa Phe 
1 5 
5 Xaa Xaa Xaa Gly Trp Xaa Xaa Xaa Xaa 

10 

Xaa Xaa Pro Xaa Xaa Xaa Xaa Ala 

15 20 
Xaa Tyr Cys Xaa Gly Xaa Cys Xaa 
10 25 30 

Xaa Pro Xaa Xaa Xaa Xaa Xaa 

35 

Xaa Xaa Xaa Asn His Ala Xaa Xaa 
40 45 
15 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

50 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 

55 60 
Cys Xaa Pro Xaa Xaa Xaa Xaa Xaa 
20 65 

Xaa Xaa Xaa Leu Xaa Xaa Xaa 

70 75 
Xaa Xaa Xaa Xaa Val Xaa Leu Xaa 
80 

25 Xaa Xaa Xaa Xaa Met Xaa Val Xaa 

85 90 
Xaa Cys Xaa Cys Xaa 
95 

wherein each Xaa is independently selected from a group 
30 of one or more specified amino acids defined as 

follows: "Res." means "residue" and Xaa at res .2 = 
(Tyr or Lys); Xaa at res. 3 « Val or lie); Xaa at res. 4 
= (Ser, Asp or Glu); Xaa at res. 6 - (Arg, Gin, Ser, Lys 
or Ala); Xaa at res. 7 = (Asp or Glu); Xaa at res. 8 = 
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(Leu, Val or lie); Xaa at res. 11 - (Gin, Leu, Asp, His, 
Asn or Ser); Xaa at res. 12 = (Asp, Arg, Asn or Glu); 
Xaa at res. 13 = (Trp or Ser); Xaa at res .14 = (lie or 
Val); Xaa at res .15 = (lie or Val); Xaa at res* 16 (Ala 
5 or Ser); Xaa at res. 18 - (Glu, Gin, Leu, Lys, Pro or 
Arg); Xaa at res. 19 = (Gly or Ser); Xaa at res. 20 = 
(Tyr or Phe); Xaa at res. 21 » (Ala, Ser, Asp, Met, His, 
Gin, Leu or Gly); Xaa at res. 2 3 « (Tyr, Asn or Phe); 
Xaa at res. 2 6 ~ (Glu, His, Tyr, Asp, Gin, Ala or Ser); 

10 Xaa at res. 28 = (Glu, Lys, Asp, Gin or Ala); Xaa at 
res .30 « (Ala, Ser, Pro, Gin, lie or Asn); Xaa at 
res. 31 » (Phe, Leu or Tyr); Xaa at res. 33 - (Leu, Val 
or Met); Xaa at res. 34 - (Asn, Asp, Ala, Thr or Pro); 
Xaa at res .35 = (Ser, Asp, Glu, Leu, Ala or Lys); Xaa 

15 at res .36 = (Tyr, Cys, His, Ser or lie); Xaa at res .37 
= (Met, Phe, Gly or Leu); Xaa at res. 38 = (Asn, Ser or 
Lys); Xaa at res. 39 = (Ala, Ser, Gly or Pro); Xaa at 
res. 40 = (Thr, Leu or Ser); Xaa at res. 4 4 = (lie, Val 
or Thr); Xaa at res. 45 = (Val, Leu, Met or lie); Xaa at 

20 res. 46 = (Gin or Arg); Xaa at res. 4 7 = (Thr, Ala or 
Ser); Xaa at res. 48 = (Leu or lie); Xaa at res. 49 - 
(Val or Met); Xaa at res. 50 = (His, Asn or Arg); Xaa at 
res. 51 = (Phe, Leu, Asn, Ser, Ala or Val); Xaa at 
res. 52 = (He, Met, Asn, Ala, Val, Gly or Leu); Xaa at 

25 res. 53 = (Asn, Lys, Ala, Glu, Gly or Phe); Xaa at 

res. 54 = (Pro, Ser or Val); Xaa at res. 55 = (Glu, Asp, 
Asn, Gly, Val, Pro or Lys); Xaa at res. 56 = (Thr, Ala, 
Val, Lys, Asp, Tyr, Ser, Gly, He or His); Xaa at 
res. 57 = (Val, Ala or He); Xaa at res .58 « (Pro or 

30 Asp); Xaa at res. 59 = (Lys, Leu or Glu); Xaa at 

res. 60 = (Pro, Val or Ala); Xaa at res. 63 « (Ala or 
Val); Xaa at res. 65 * (Thr, Ala or Glu); Xaa at res .66 
= (Gin, Lys, Arg or Glu); Xaa at res. 67 = (Leu, Met or 
Val); Xaa at res. 68 = (Asn, Ser, Asp or Gly); Xaa at 
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res. 69 = (Ala, Pro or Ser); Xaa at res .70 ■ (He, Thr, 
Val or Leu); Xaa at res. 71 = (Ser, Ala or Pro); Xaa at 
res. 72 = (Val r Leu, Met or lie); Xaa at res. 74 « (Tyr 
or Phe); Xaa at res. 75 = (Phe, Tyr, Leu or His); Xaa at 
5 res. 76 = (Asp, Asn or Leu); Xaa at res. 77 = (Asp, Glu, 
Asn, Arg or Ser); Xaa at res. 78 = (Ser, Gin, Asn, Tyr 
or Asp); Xaa at res. 79 « (Ser, Asn, Asp, Glu or Lys); 
Xaa at res .80 = (Asn, Thr or Lys); Xaa at res. 82 = 
(lie, Val or Asn); Xaa at res. 84 = (Lys or Arg); Xaa at 

10 res. 85 = (Lys, Asn, Gin, His, Arg or Val); Xaa at 

res .86 = (Tyr, Glu or His); Xaa at res. 87 - (Arg, Gin, 
Glu or Pro); Xaa at res. 88 = (Asn, Glu, Trp or Asp); 
Xaa at res. 90 = (Val, Thr, Ala or lie); Xaa at res. 92 = 
(Arg, Lys, Val, Asp, Gin or Glu); Xaa at res. 93 « (Ala, 

15 Gly, Glu or Ser); Xaa at res. 95 « (Gly or Ala) and Xaa 
at res. 97 * (His or Arg). 

As described above, Generic Sequence 8 (Seg. ID No. 
21) includes all of Generic Sequence 7 and in addition 
20 includes the following sequence at its N- terminus: 

Cys Xaa Xaa Xaa Xaa 
1 5 

25 Accordingly, beginning with residue 7, each "Xaa" 

in Generic Seq. 8 is a specified amino acid defined as 
for Generic Seq. 7, with the distinction that each 
residue number described for Generic Sequence 7 is 
shifted by five in Generic Seq. 8. Thus, "Xaa at res. 2 

30 =(Tyr or Lys)" in Gen. Seq. 7 refers to Xaa at res. 7 
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in Generic Seq. 8. In Generic Seq. 8, Xaa at res .2 = 
(Lys, Arg, Ala or Gin); Xaa at res. 3 = (Lys, Arg or 
Met); Xaa at res. 4 = (His, Arg or Gin); and Xaa at 
res. 5 = (Glu, Ser, His, Gly, Arg, Pro, Thr, or Tyr). 

5 

Accordingly, other useful sequences defining 
preferred C-terminal sequences are those sharing at 
least 70% amino acid sequence homology or "similarity", 
and preferably 80% homology or similarity with any of 

10 the sequences incorporated into Generic Seq. 7 and 8 
above. These are anticipated to include allelic, 
species, chimeric and other sequence variants, (e.g., 
including "muteins" or "mutant proteins"), whether 
naturally-occurring or biosynthetically produced, as 

15 well as novel members of this morphogenic family of 
proteins. As used herein, "amino acid sequence 
homology" is understood to mean amino acid sequence 
similarity, and homologous sequences share identical or 
similar amino acids, where similar amino acids are 

20 conserved amino acids as defined by Dayoff et al.. 
Atlas of Protein Sequence and Structure ; vol.5, 
Suppl.3, pp. 345-362 (M.O. Dayoff, ed., Nat'l BioMed. 
Research Fdn., Washington D.C. 1978.) Thus, a 
candidate sequence sharing 70% amino acid homology with 

25 a reference sequence requires that, following alignment 
of the candidate sequence with the reference sequence, 
70% of the amino acids in the candidate sequence are 
identical to the corresponding amino acid in the 
reference sequence, or constitute a conserved amino 

30 acid change thereto. "Amino acid sequence identity" is 
understood to require identical amino acids between two 
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aligned sequences. Thus, a candidate sequence sharing 
60% amino acid identity with a reference sequence 
requires that, following alignment of the candidate 
sequence with the reference sequence, 60% of the amino 
5 acids in the candidate sequence are identical to the 
corresponding amino acid in the reference sequence. 

As used herein, all homologies and identities 
calculated use OP-1 as the reference sequence. Also as 

10 used herein, sequences are aligned for homology and 

identity calculations using the method of Needleman et 
al. (1970) J.Mol. Biol. 48;443-453 and identities 
calculated by the Align program (DNAstar, Inc.) In all 
cases, internal gaps and amino acid insertions in the 

15 candidate sequence as aligned are ignored when making 
the homology/identity calculation. 

Also as used herein, "sequence variant" is 
understood to mean an amino acid sequence variant form 

20 of the morphogen protein, wherein the amino acid change 
or changes in the sequence do not alter significantly 
the morphogenic activity (e.g. r tissue regeneration 
activity) of the protein, and the variant molecule 
performs substantially the same function in 

25 substantially the same way as the naturally-occurring 
form of the molecule. Sequence variants may include 
single or multiple amino acid changes, and are intended 
to include chimeric sequences as described below. The 
variants may be naturally-occurring or may be 

30 biosynthetically induced by using standard recombinant 
DNA techniques or chemical protein synthesis 
methodologies • 
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The currently most preferred protein sequences 
useful in soluble morphogen complexes in this invention 
include those having greater than 60% identity, 
preferably greater than 65% identity, with the amino 
5 acid sequence defining the conserved six cysteine 
skeleton of hOFl (e.g., residues 335-431 of Seq. ID 
No* 5). These most preferred sequences include both 
allelic and species variants of the OP-1 and OP-2 
proteins, including the Drosophila 60A protein. 

10 Accordingly, in another preferred aspect of the 

invention, useful morphogens include active proteins 
comprising species of polypeptide chains having the 
generic amino acid sequence herein referred to as 
"OPX", which accommodates the homologies between the 

15 various identified species of OP1 and OP2 (Seq. ID 
No. 22). 

In still another preferred aspect of the invention, 
useful morphogens include active proteins comprising 

20 amino acid sequences encoded by nucleic acids that 
hydridize to DNA or RNA sequences encoding the 
conserved C-terminal cysteine domain of OP1 or OP2, 
e.g., defined by nucleotides 1036-1341 and nucleotides 
1390-1695 of Seq. ID Nos. 1 and 5, respectively, under 

25 stringent hybridization conditions. As used herein, 
stringent hybridization conditions are defined as 
hybridization in 40% formamide, 5 X SSPE, 5 X 
Denhardt's Solution, and 0.1% SDS at 37 °C overnight, 
and washing in 0.1 X SSPE, 0.1% SDS at 50 °C. 

30 Similarly, in another preferred aspect of the 
invention, useful pro region peptides include 
polypeptide chains comprising amino acid sequences 
encod d by nucleic acids that hybridize to DNA or RNA 
sequences ncoding at least the N-terminal 18 amino 
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acids of the pro region sequences for any of the 
sequences listed in Seq. ID Nos. 1-19 , under stringent 
hybridization conditions. Most preferably, the 
peptides are encoded by nucleic acids that hybridize to 
the DNA or RNA sequences encoding at least the 
N-terminal 18 amino acids of the pro region sequences 
for OP1 or OP2, e.g., nucleotides 136-192 and 
nucleotides 152-211 of Seq. ID Nos. 1 and 5, 
respectively. 



Useful N-terminal extension sequences are listed in 
Fig. 2 for use with the C-terminal domains described 
above. Also as described above, the full length N- 
terminal extensions, or truncated forms thereof, may be 

15 used in preferred dimeric species. The mature dimeric 
species may be produced from intact DNAs, or truncated 
forms thereof. It also is envisioned as an embodiment 
of the invention that chimeric morphogen sequences can 
be used. Thus, DNAs encoding chimeric morphogens may 

20 be constructed using part or all of the N-terminal 
extension from one morphogen and a C-terminal domain 
derived from one or more other morphogens. These 
chimeric proteins may be synthesized using standard 
recombinant DNA methodology and/or automated chemical 

25 nucleic acid synthesis methodology well described in 
the art. Other chimeric morphogens include soluble 
morphogen complexes where the pro domain is encoded 
from a DNA sequence corresponding to one or more 
morphogen pro sequences, and part or all of the mature 

30 domain is encoded by DNA derived from one or more 
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other, different morphogens. These soluble chimerics 
may be produced from a single synthetic DNA as 
described below, or, alternatively, may be formulated 
in vitro from isolated components also as described 
5 herein below. 

Finally, the morphogen pro domains and/or mature 
form N-terminal extensions themselves may be useful as 
tissue targeting sequences. As described above, the 

10 morphogen family members share significant sequence 
homology in their C-terminal active domains. By 
contrast, the sequences diverge significantly in the 
sequences which define the pro domain and the 
N-terminal 39 amino acids of the mature protein^ 

15 Accordingly, the pro domain and/or N-terminal extension 
sequence may be morphogen-specif ic. Accordingly, part 
or all of these morphogen-specif ic sequences may serve 
as tissue targeting sequences for the morphogens 
described herein. For example, the N-terminal 

20 extension and/or pro domains may interact specifically 
with one or more molecules at the target tissue to 
direct the morphogen associated with the pro domain to 
that tissue. Thus, for example, the morphogen-specif ic 
sequences of OP-1, BMP2 or BMP4, all of which proteins 

25 are found naturally associated with bone tissue (see, 
for example, US Pat. No. 5,011,691) may be particularly 
useful sequences when the morphogen complex is to be 
targeted to bone. Similarly, BMP6 (or Vgr-1) specific 
sequences may be used when targeting to lung tissue is 

30 desired. Alternatively, the morphogen-specif ic 
sequences of GDF-1 may be used to target soluble 
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morphogen complexes to nerve tissue, particularly brain 
tissue, where GDF-1 appears to be primarily expressed 
(see, for example, Lee, PNAS , 88:4250-4254 (1991), 
incorporated herein by reference). 

5 

II* Recombinant Production of Soluble 
Morphogen Complexes 

Soluble morphogen complexes can be produced from 
10 eukaryotic host cells, preferably mammalian cells, 

using standard recombinant expression techniques. An 
exemplary protocol currently preferred, is provided 
below, using a particular vector construct and Chinese 
hamster ovary (CHO) cell line. Those skilled in the 
15 art will appreciate that other expression systems are 
contemplated to be useful, including other vectors and 
other cell systems, and the invention is not intended 
to be limited to soluble morphogenic protein complexes 
produced only by the method detailed here inbe low. 
20 Similar results to those described herein have been 

observed using recombinant expression systems developed 
for COS and BSC cells. 

Morphogen DNA encoding the precursor sequence is 
25 subcloned into an insertion site of a suitable, 

commercially available pUC-type vector (e.g., pUC-19, 
ATCC #37254, Rockville, MD), along with a suitable 
promoter/enhancer sequences and 3' termination 
sequences. Useful DNA sequences include the published 
30 sequences encoding these proteins, and/or synthetic 
constructs. Currently preferred promoter/enhancer 
sequences are the CMV promoter (human cytomegalovirus 
major intermediate - early promoter) and the mouse 
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mammary tumor virus promoter (mMTV) boosted by the rous 
sarcoma virus LTR enhancer sequence (e.g., from 
Clontech, Inc., Palo Alto). Expression also may be 
further enhanced using trans activating enhancer 
5 sequences. The plasmid also contains DHFR as an 

amplifiable marker, under SV40 early promoter control 
(ATCC #37148). Transfection, cell culturing, gene 
amplification and protein expression conditions are 
standard conditions, well known in the art, such as are 

10 described, for example in Ausubel et al., ed., Current 
Protocols in Molecular Biology , John Wiley & Sons, NY 
(1989). Briefly, transfected cells are cultured in 
medium containing 0.1-0.5% dialyzed fetal calf serum 
(FCS) and stably transfected high expression cell lines 

15 are obtained by subcloning and evaluated by standard 
Western or Northern blot. Southern blots also are used 
to assess the state of integrated sequences and the 
extent of their copy number amplification. 

20 A currently preferred expression vector contains 

the DHFR gene, under SV40 early promoter control, as 
both a selection marker and as an inducible gene 
amplifier. The DNA sequence for DHFR is well 
characterized in the art, and is available 

25 commercially. For example, a suitable vector may be 
generated from pMAM-neo (Clontech, Inc., Palo Alto, CA) 
by replacing the neo gene (BamHl digest) with an Sphl- 
BamHI, or a PvuII-BamHI fragment from pSV5-DHFR (ATCC 
#37148), which contains the DHFR gene under SV40 early 

30 promoter control. A BamHl site can be engineered at 

the SphI or PvuII site using standard techniques (e.g., 
by linker insertion or site-directed mutagenesis) to 
allow insertion of the fragment into the vector 
backbone. The morphogen DNA can be ins rted into the 
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polylinker site downstream of the MMTV-LTR sequence 
(mouse mammary tumor virus LTR) . The CMV promoter 
sequence then may be inserted into the expression 
vector (e.g., from pCDM8, Invitrogen, Inc.) The SV40 
5 early promoter, which drives DHFR expression, 

preferably is modified in these vectors to reduce the 
level of DHFR mRNA produced. 

The currently preferred mammalian cell line is a 
10 CHO Chinese hamster ovary, cell line, and the preferred 
procedure for establishing a stable morphogen 
production cell line with high expression levels 
comprises trans fecting a stable CHO cell line, 
preferably CHO-DXB11, with the expression vector 
15 described above, isolating clones with high jnorphogen 
expression levels, and subjecting these clones to 
cycles of subcloning using a limited dilution method 
described below to obtain a population of high 
expression clones. Subcloning preferably is performed 
20 in the absence of MTX to identify stable high 

expression clones which do not require addition of MTX 
to the growth media for morphogen production. 

In the subcloning protocol cells are seeded on ten 
25 100mm petri dishes at a cell density of either 50 or 
100 cells per plate, with or preferably without MTX in 
the culture media. After 14 days of growth, clones are 
isolated using cloning cylinders and standard 
procedures, and cultured in 24-well plates. Clones 
30 then are screened for morphogen expression by Western 
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immunoblots using standard procedures, and morphogen 
expression levels compared to parental lines. Cell 
line stability of high expression subclones then is 
determined by monitoring morphogen expression levels 
5 over multiple cell passages (e.g., four or five 
passages) • 

Isolation of Soluble morphogen complex from 
conditioned media or body fluid 

10 

Horphogens are expressed from mammalian cells as 
soluble complexes. Typically, however the complex is 
disassociated during purification, generally by 
exposure to denaturants often added to the purification 

15 solutions, such as detergents, alcohols, organic 
solvents, chaotropic agents and compounds added to 
reduce the pH of the solution. Provided below is a 
currently preferred protocol for purifying the soluble 
proteins from conditioned media (or, optionally, a body 

20 fluid such as serum, cerebro-spinal or peritoneal 

fluid), under non-denaturing conditions. The method is 
rapid, reproducible and yields isolated soluble 
morphogen complexes in substantially pure form. 

25 Soluble morphogen complexes can be isolated from 

conditioned media using a simple, three step 
chromatographic protocol performed in the absence of 
denaturants. The protocol involves running the media 
(or body fluid) over an affinity column, followed by 

30 ion exchange and gel filtration chromatographies. The 
affinity column described below is a Zn-IMAC column. 
The present protocol has general applicability to the 
purification of a variety of morphogens, all of which 
are anticipated to be isolatable using only minor 
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modifications of the protocol described below. An 
alternative protocol also envisioned to have utility an 
immunoaf finity column, created using standard 
procedures and, for example, using antibody specific 
5 for a given morphogen pro domain (complexed, for 

example, to a protein A-conjugated Sepharose column.) 
Protocols for developing immunoaf finity columns are 
well described in the art, (see, for example, Guide to 
Protein Purification , M. Deutscher, ed., Academic 
10 Press, San Diego, 1990, particularly sections VII and 
XI. ) 

In this experiment OP-1 was expressed in CHO cells 
as described above. The CHO cell conditioned media 
containing 0.5% FBS was initially purified using 
Immobilized Metal-Ion Affinity Chromatography (IHAC). 
The soluble OP-1 complex from conditioned media binds 
very selectively to the Zn-IMAC resin and a high 
concentration of imidazole (50 mM imidazole, pH 8.0) is 
required for the effective elution of the bound 
complex. The Zn-IMAC step separates the soluble OP-1 
from the bulk of the contaminating serum proteins that 
elute in the flow through and 35 mM imidazole wash 
fractions. The Zn-IMAC purified soluble OP-1 is next 
applied to an S- Sepharose cation-exchange column 
equilibrated in 20 mM NaP0 4 (pH 7.0) with 50 mM NaCl. 
This S-Sepharose step serves to further purify and 
concentrate the soluble OP-1 complex in preparation for 
the following gel filtration step. The protein was 
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applied to a Sephacryl S-200HR column equilibrated in 
TBS. Using substantially the same protocol, soluble 
morphogens also may be isolated from one or more body 
fluids, including serum, cerebro-spinal fluid or 
5 peritoneal fluid. 

IMAC was performed using Chelating- Sepharose 
(Pharmacia) that had been charged with three column 
volumes of 0.2 M ZnS0 4 . The conditioned media was 

10 titrated to pH 7.0 and applied directly to the ZN-IMAC 
resin equilibrated in 20 mM HEPES (pH 7.0) with 500 mM 
NaCl. The Zn-IMAC resin was loaded with 80 mL of 
starting conditioned media per mL of resin. After 
loading the column was washed with equilibration buffer 

15 and most of the contaminating proteins were eluted with 
35 mM imidazole (pH 7.0) in equilibration buffer. The 
soluble OP-1 complex is then eluted with 50 mM 
imidazole (pH 8.0) in 20 mM HEPES and 500 mM NaCl. 

20 The 50 mM imidazole eluate containing the soluble 

OP-1 complex was diluted with nine volumes of 20 mM 
NaP0 4 (pH 7.0) and applied to an S-Sepharose 
(Pharmacia) column equilibrated in 20 mM NaP0 4 (pH 7.0) 
with 50 mM NaCl. The S-Sepharose resin was loaded with 

25 an equivalent of 800 mL of starting conditioned media 
per mL of resin. After loading the S-Sepharose column 
was washed with equilibration buffer and eluted with 
100 mM NaCl followed by 300 mM and 500 mM NaCl in 20 mM 
NaP0 4 (pH 7.0). The 300 mM NaCl pool was further 

30 purified using gel filtration chromatography. Fifty 
mis of the 300 mm NaCl eluate was applied to a 5.0 X 90 
cm Sephacryl S-200HR (Pharmacia) equilibrated in Tris 
buffer d saline (TBS), 50 mM Tris, 150 mM NaCl 
(pH 7.4). The column was eluted at a flow rate of 5 
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mL/minute collecting 10 mL fractions. The apparent 
molecular of the soluble OP-1 was determined by 
comparison to protein molecular weight standards 
(alcohol dehydrogenase (ADH, 150 kDa), bovine serum 
5 albumin (BSA, 68 kDa), carbonic anhydrase (CA, 30 kDa) 
and cytochrome C (cyt C, 12.5 kDa). (see Pig. 3) The 
purity of the S-200 column fractions was determined by 
separation on standard 15% polyacrylamide SDS gels 
stained with coomassie blue. The identity of the 
10 mature OP-1 and the pro-domain was determined by 

N- terminal sequence analysis after separation of the 
mature OP-1 from the pro-domain using standard reverse 
phase C18 HPLC. 

15 Figure 3 shows the absorbance profile at- 280 nm. 

The soluble OP-1 complex elutes with an apparent 
molecular weight of 110 kDa. This agrees well with the 
predicted composition of the soluble OP-1 complex with 
one mature OP-1 dimer (35-36 kDa) associated with two 

20 pro-domains (39 kDa each). Purity of the final complex 
can be verified by running the appropriate fraction in 
a reduced 15% polyacrylamide gel. 

The complex components can be verified by running 
25 the complex-containing fraction from the S-200 or S- 

200HR columns over a reverse phase C18 HPLC column and 
eluting in an acetonitrile gradient (in 0.1% TFA) , 
using standard procedures. The complex is dissociated 
by this step, and the pro domain and mature species 
30 elute as separate species. These separate species then 
can be subjected to N- terminal sequencing using 
standard procedures (see, for example, Guide to 
Protein Purification , M. Deutscher, ed., Academic 
Press, San Diego, 1990, particularly pp. 602-613), and 
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domain with the dimer. Useful denaturants include 4-6M 
urea or guanidine hydrochloride (GuHCl), in buffered 
solutions of pH 4-10, preferably pH 6-8. The soluble 
complex then is formed by controlled dialysis or 
5 dilution into a solution having a final denaturant 
concentration of less than 0.1-2M urea or GuHCl, 
preferably 1-2 M urea of GuHCl, which then preferably 
can be diluted into a physiological buffer. Protein 
pur if ication/renaturing procedures and considerations 

10 are well described in the art, and details for 

developing a suitable renaturing protocol readily can 
be determined by one having ordinary skill in the art. 
One useful text one the subject is Guide to Protein 
Purification , M. Deutscher, ed., Academic Press, San 

15 Diego, 1990, particularly section V. Complex formation 
also may be aided by addition of one or more chaperone 
proteins . 



20 



VI. Stability of Soluble Morphoqen Complexes 



The stability of the highly purified soluble 
morphogen complex in a physiological buffer, e.g., 
tris-buf fered saline (TBS) and phosphate-buffered 
saline (PBS), can be enhanced by any of a number of 

25 means. Currently preferred is by means of a pro region 
that comprises at least the first 18 amino acids of the 
pro sequence (e.g., residues 30-47 of Seq. ID NO. 1 for 
OP-1), and preferably is the full length pro region. 
Residues 30-47 show sequence homology to the N-terminal 

30 portion of other morphogens and are believed to have 
particular utility in enhancing complex stability for 
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all morphogens. Other useful means for enhancing the 
stability of soluble roorphogen complexes include three 
classes of additives. These additives include basic 
amino acids (e.g., L-arginine, lysine and betaine); 
5 nonionic detergents (e.g., Tween 80 or Nonldet P-120); 
and carrier proteins (e.g., serum albumin and casein). 
Useful concentrations of these additives include 1-100 
mM, preferably 10-70 mM, including 50 mM, basic amino 
acid;, 0.01-1.0%, preferably 0.05-0.2%, including 0.1% 
10 (v/v) nonionic detergent;, and 0.01-1.0%, preferably 
0.05-0.2%, including 0.1% (w/v) carrier protein. 

VII. Activity of Soluble Morphoqen Complex 

15 Association of the pro domain with the mature 

dimeric species does not interfere with the morphogenic 
activity of the protein in vivo as demonstrated by 
different activity assays. Specifically, soluble OP-1 
complex provided in a standard rat osteopenia model 

20 induces significant increase in bone growth and 
osteocalcin production (see Table II, below), in a 
manner analogous to the results obtained using mature 
morphogen. 

25 The assay is analogous to the osteoporosis model 

described in international application US92/07432 
(WO93/05751) , but uses aged female rats rather than 
ovariectomized animals. Briefly, young or aged female 
rats (Charles River Labs, 115-145, and 335-4 6 Og body 

30 weight, respectively) were dosed daily for 7 days by 
intravenous tail injection, with either 20 pg/Kg body 
weight soluble OP-1, or 100 pg/Kg body weight soluble 
OP-1. Control groups of young and aged female rats 
were dosed only with tris-buf fered salin (TBS). Water 
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and food were provided to all animals ad libitum. 
After 14 days, animals were sacrificed, and new bone 
growth measured by standard his tome trie procedures. 
Osteocalcin concentrations in serum also were measured. 
5 No detrimental effects of morphogen administration were 
detected as determined by changes in animal body or 
organ weight or by hematology profiles. 

TABLE II 

10 

No. Bone Area Osteocalcin 





Animals 


Animal Group 


(B.Ar/T.Ar ) 


(nq/ml) 




15 


4 


Control 


5.50 + 0.64 


11.89 + 


4.20 


20 


5 


Aged female, 
20/ig/Kg 
sol. OP-1 


7.68 + 0.63** 


22.24 + 


2.28** 


25 


5 


Aged female/ 
100/jg/Kg 
sol. OP-1 


9.82 + 3.31* 


20.87 + 


6.14*" 



*P < 0.05 
**P < 0,01 



30 Similar experiments performed using soluble OP-1 

complex in the osteoporosis model described in 
WO93/05751 using ovariectomized rats also show no 
detrimental effect using the complex form. 

35 Both mature and soluble morphogen also can induce 

CAM (cell adhesion molecule) expression, as 
demonstrated below. Briefly, induction of N-CAM 
isoforms (N-CAM-180, N-CAM- 140 and N-CAM- 120) can be 
monitored by reaction with the commercially available 

40 antibody mAb H28.123 (Sigma Co., St. Louis) and 
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available antibody mAb H28.123 (Sigma Co., St. Louis) 
and standard Western blot analysis (see, for example, 
Molecular Cloning, A Laboratory Manual , Sambrook et al. 
eds. Cold Spring Harbor Press, New York, 1989, 
5 particularly Section 18). Incubation of a growing 
culture of transformed cells of neuronal origin, 
NG108-15 eels (ATCC, Rockville, MD) with either mature 
morphogen dimers or soluble morphogen complexes (10-100 
ng/ml, preferably at least 40 ng/ml) induces a 

10 redifferentiation of these cells back to a morphology 
characteristic of untransformed neurons, including 
specific induction and/or enhanced expression of all 3 
N-CAM isoforms- In the experiment, cells were 
subcultured on poly-L- lysine coated 6-well plates and 

15 grown in chemically defined medium for 2 days before 

the experiment. Fresh aliguots of morphogen were added 
(2.5/jl) daily. 

VIII. Antibody Production 

20 

Provided below are standard protocols for 
polycolonal and monoclonal antibody production. For 
antibodies which recognize the soluble complex only, 
preferably the isolated pro region is used as the 
25 antigen; where antibodies specific to the mature 

protein are desired, the antigen preferably comprises 
at least the C-terminal domain or the intact mature 
sequence. 

30 Polyclonal antibody may be prepared as follows. 

Each rabbit is given a primary immunization of 100 
ug/500 pi of antigen, in 0.1% SDS mixed with 500 /il 
Complete Freund's Adjuvant. The antigen is injected 



WO 94/03600 



PCT/US93/07189 



- 42 - 

subcutaneous ly at multiple sites on the back and flanks 
of the animal. The rabbit is boosted after a month in 
the same manner using incomplete Freund's Adjuvant. 
Test bleeds are taken from the ear vein seven days 
5 later. Two additional boosts and test bleeds are 
performed at monthly intervals until antibody against 
the morphogen antigen is detected in the serum using an 
ELISA assay. Then, the rabbit is boosted monthly with 
100 fjg of antigen and bled (15 ml per bleed) at days 
10 seven and ten after boosting. 

Monoclonal antibody specific for a given morphogen 
may be prepared as follows. A mouse is given two 
injections of the morphogen antigen. The protein or 

15 protein fragment preferably is recombinantly produced. 
The first injection contains 100/jg of antigen in 
complete Freund's adjuvant and is given subcutaneously. 
The second injection contains 50 fjg of antigen in 
incomplete adjuvant and is given intraperitoneal ly. 

20 The mouse then receives a total of 230 fjg of OP-3 in 
four intraperitoneal injections at various times over 
an eight month period. One week prior to fusion, the 
mouse is boosted intraperitoneally with antigen (e.g., 
100 pg) and may be additionally boosted with a peptide 

25 fragment^ conjugated to bovine serum albumin with a 
suitable crosslinking agent. This boost can be 
repeated five days (IP)/ four days (IP)/ three days 
(IP) and one day (IV) prior to fusion. The mouse 
spleen cells then are fused to commercially available 

30 myeloma cells at a ratio of 1:1 using PEG 1500 
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(Boeringer Mannheim, Germany), and the fused cells 
plated and screened for mature or soluble morphogen- 
specific antibodies using the appropriate portion of 
the morphogen sequence as antigen. The cell fusion and 
5 monoclonal screening steps readily are performed 
according to standard procedures well described in 
standard texts widely available in the art. 

Using these standard procedures, anti-pro domain 
10 antisera was prepared from rabbits using the isolated 
pro domain from OP-1 as the antigen, and monoclonal 
antibody ("mAb") to the mature domain was produced in 
mice, using an E. coli- produced truncated form of OP-1 
as antigen* 

15 

Standard Western blot analysis performed under 
reducing conditions demonstrates that the anti-pro 
domain antisera ("anti-pro") is specific for the pro 
domain only, while the mAb to mature OP-1 ("anti-mature 

20 OP-1") is specific for the dimer subunits, that the two 
antibodies do not cross-react, and that the antibodies 
and can be used to distinguish between soluble and 
mature protein forms in a sample, e.g., of conditioned 
media or serum. A tabular representation of the 

25 Western blot results is in Table III below, where 

reactivity of mAb to mature OP-1 is indicated by "yy", 
and reactivity of the anti-pro antisera is indicated by 
"xx". 
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TABLE III 

Purified 

Purified Conditioned Isolated Dimer 
5 Antibody Sol OPl CHO Cell Media Pro Domain Subunits 

"anti-pro" xx xx xx 



io M anti- yy yy yy 

mature OP-1" 



15 IX. Immunoassays 

The ability to detect morphogens in solution and to 
distinguish between soluble and mature dimeric 
morphogen forms provides a valuable tool for" diagnostic 
20 assays, allowing one to monitor the level and type of 
morphogen free in the body, e.g., in serum and other 
body fluids, as well as to develop diagnostic and other 
tissue evaluating kits. 

25 For example, OP-1 is an intimate participant in 

normal bone growth and resorption. Thus, soluble OP-1 
is expected to be detected at higher concentrations in 
individuals experiencing high bone turnover, such as 
children, and at substantially lower levels in 

30 individuals with abnormally low rates of bone turnover, 
such as patients with osteoporosis, osteosarcoma, 
Paget 's disease and the like. Monitoring the level of 
OP-1, or other bone targeted morphogens such as BMP2 
and BMP4, in serum thus provides a means for evaluating 

35 the status of bone tissue in an individual, as well as 
a means for monitoring the efficacy of a treatment to 
regenerate damaged or lost bone tissue. Similarly, 



WO 94/03600 



PCT/US93/07189 



- 45 - 

monitoring -the level of endogenous GDF-1, can provide 
diagnostic information on the health of nerve tissue, 
particularly brain tissue* Moreover, following this 
disclosure one can distinguish between the level of 
5 soluble and mature forms in solution. 

A currently preferred detection means for 
evaluating the level of morphogen in a body fluid 
comprises an immunoassay utilizing an antibody or other 

10 suitable binding protein capable of reacting 

specifically with a morphogen and being detected as 
part, of a complex with the morphogen. Immunoassays may 
be performed using standard techniques known in the art 
and antibodies raised against a morphogen and specific 

15 for that morphogen. Antibodies which recognize a 

morphogen protein form of interest may be generated as 
described herein and these antibodies then used to 
monitor endogenous levels of protein in a body fluid, 
such as serum, whole blood or peritoneal fluid. To 

20 monitor endogenous concentrations of soluble morphogen, 
the antibody chosen preferably has binding specificity 
for the soluble form e.g., has specificity for the pro 
domain. Such antibodies may be generated by using the 
pro domain or a portion thereof as the antigen, 

25 essentially as described herein. A suitable pro domain 
for use as an antigen may be obtained by isolating the 
soluble complex and then separating the noncovalently 
associated pro domain from the mature domain using 
standard procedures, e.g., by passing the complex over 

30 an HPLC column, as described above or by separation by 
gel electrophoresis. Alternatively, the pro form of 
the protein in its monomeric form may be used as the 
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antigen and the candidate antibodies screened by 
Western blot or other standard immunoassay for those 
which recognize the pro domain of the soluble form of 
the protein of interest, but not the mature form, also 
5 as described above. 

Monomeric pro forms can be obtained from cell 
lysates of CHO produced cells, or from prokaryotic 
expression of a DNA encoding the pro form, in for 
10 example, B.coli . The pro form, which has an apparent 
molecular weight of about 50 kDa in mammalian cells, 
can then be isolated by HPLC and/or by gel 
electrophoresis, as described above. 

15 In order to detect and/or quantitate the amount of 

morphogenic protein present in a solution, an 
immunoassay may be performed to detect the morphogen 
using a polyclonal or monoclonal antibody specific for 
that protein. Here, soluble and mature forms of the 

20 morphogen also may be distinguished by using antibodies 
that discriminate between the two forms of the proteins 
as described above. Currently preferred assays include 
ELISAS and radioimmunassays, including standard 
competitor assays useful for guantitating the morphogen 

25 in a sample, where an unknown amount of sample 
morphogen is allowed to react with anti-morphogen 
antibody and this interaction is competed with a known 
amount of labeled antigen. The level of bound or free 
labeled antigen at equilibrium then is measured to 

30 quantitate the amount of unlabeled antigen in solution, 
the amount of sample antigen being proportional to the 
amount of free labeled antigen. Exemplary protocols 
for these assays are provided below. However, as will 
be appreciated by those skilled in the art, variations 
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of these protocols, as well as other immunoassays, are 
well known in the literature and within the skill of 
the art* For example, in the ELISA protocol provided 
below, soluble OP-1 is identified in a sample using 
5 biotinylated anti-pro antiserum. Biotinylated 

antibodies can be visualized in a colormetric assay or 
in a chemiluminescent assay, as described below. 
Alternatively, the antibody can be radio-labeled with a 
suitable molecule, such as 12 5 1. Still another 

10 protocol that may be used is a solid phase immunoassay, 
preferably using an affinity column with anti-morphogen 
antibody completed to the matrix surface and over which 
a serum sample may be passed. A detailed description 
of useful immunoassays, including protocols and general 

15 considerations is provided in, for example, Molecular 
Cloning ; A Laboratory Manual , Sambrook et al., eds. 
Cold Spring Harbor Press, New York, 1989 f particularly 
Section 18. 

20 For serum assays, the serum preferably first is 

partially purified to remove some of the excess, 
contaminating serum proteins, such as serum albumin. 
Preferably the serum is extracted by precipitation in 
ammonium sulfate (e.g., 45%) such that the complex is 

25 precipitated. Further purification can be achieved 
using purification strategies that take advantage of 
the differential solubility of soluble morphogen 
complex or mature morphogens relative to that of the 
other proteins present in serum. Further purification 

30 also can be achieved by chromatographic techniques well 
known in the art. 



WO 94/03600 



PCT/US93/07189 



- 48 - 

Soluble OP-1 may be detected using a polyclonal 
antibody specific for the OP-1 pro domain in an ELISA, 
as follows. 1 pg/100 pi of affinity-purified 
polyclonal rabbit IgG specific for OP-l-pro is added to 
5 each well of a 96-well plate and incubated at 37 °C for 
an hour. The wells are washed four times with 0.167M 
sodium borate buffer with 0.15 M NaCl (BSB), pH 8.2, 
containing 0.1% Tween 20. To minimize non-specific 
binding, the wells are blocked by filling completely 

10 with 1% bovine serum albumin (BSA) in BSB and 

incubating for 1 hour at 37 °C. The wells are then 
washed four times with BSB containing 0.1% Tween 20. A 
100 fjl aliquot of an appropriate dilution of each of 
the test samples of cell culture supernatant or serum 

15 sample is added to each well in triplicate and 

incubated at 37 °C for 30 rain. After incubation, 100 /jl 
biotinylated rabbit anti-pro serum (stock solution is 
about 1 mg/ml and diluted 1:400 in BSB containing 1% 
BSA before use) is added to each well and incubated at 

20 37°C for 30 min. The wells are then washed four times 
with BSB containing 0.1% Tween 20. 100 pi 
strepavidin-alkaline (Southern Biotechnology 
Associates, Inc. Birmingham, Alabama, diluted 1:2000 in 
BSB containing 0.1% Tween 20 before use) is added to 

25 each well and incubated at 37°C for 30 min. The plates 
are washed four times with 0.5M Tris buffered Saline 
(TBS), pH 7.2. 50pl substrate (ELISA Amplification 
System Kit, Life Technologies, Inc., Bethesda, MD) is 
added to each well incubated at room temperature for 15 

30 min. Then, 50 pi amplifier (from the same 

amplification system kit) is added and incubated for 
another 15 min at room temperature. The reaction is 
stopped by the addition of 50 /j1 0.3 M sulphuric acid. 
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The OD at 490 nm of the solution in each well is 
recorded. To quantitate the level of soluble OP-1 in 
the sample, a standard curve is performed in parallel 
with the test samples. In the standard curve, known 
5 increasing amounts of purified OP-1 -pro is added. 
Alternatively, using, for example, Lumi-phos 530 
(Analytical Luminescence Laboratories) as the substrate 
and detection at 300-650 nm in a standard luminometer, 
complexes can be detected by chemiluminescence, which 
10 typically provides a more sensitive assay than 
detection by means of a visible color change. 

Morphogen (soluble or mature form) may be detected 
in a standard plated-based radioimmunoassay as follows. 

15 Empirically determined limiting levels of 

anti-morphogen antibody (e.g., anti-OP-1, typically 
50-80 ng/well) are bound to wells of a PVC plate e.g., 
in 50 pi PBS phosphate buffered saline. After 
sufficient incubation to allow binding at room 

20 temperature, typically one hour, the plate is washed in 
a PBS/Tween 20 solution, ("washing buffer" ), and 200 pi 
of block (3% BSA, O.lp lysine in lxBSB) is added to 
each well and allowed to incubate for 1 hour, after 
which the wells are washed again in washing buffer. 40 

25 pi of a sample composed of serially diluted plasma 

(preferably partially purified as described above) or 
morphogen standard (e.g., OP-1) is added to wells in 
triplicate. Samples preferably are diluted in PTTH 
(15 mM KH 2 P0 4 , 8 mM Na 2 P0 4 , 27 mM KC1, 137 mM NaCl, 

30 0.05% Tween 20, 1 mg/ml HSA, 0.05% NaN^, pH 7.2). 
10 pi of labelled competitor antigen, preferably 
100,000-500,000 cpm/sample is added (e.g., 125 I OP-1, 
radiolabell d using standard procedures), and plates 
are incubated overnight at 4°C. Plates then are washed 
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in washing buffer , and allowed to dry. Wells are cut 
apart and bound labelled OP-1 counted in a standard 
gamma counter. The quantities of bound labelled 
antigen (e.g., 125 I OP-1) measured in the presence and 
5 absence of sample then are compared, the difference 
being proportional to the amount of sample antigen 
(morphogen) present in the sample fluid* 

As a corollary assay method, immunoassays may be 
10 developed to detect endogenous anti-morphogen 

antibodies, and to distinguish between such antibodies 
to soluble or mature forms. Endogenous anti-morphogen 
antibodies have been detected in serum, and their level 
is known to increase, for example, upon implanting of 
15 an osteogenic device in a mammal. Without being 

limited to a particular theory, these antibodies may 
play a role in modulating morphogen activity by 
modulating the level of available protein in serum. 
Assays that monitor the level of endogenous antibodies 
20 in blood or their body fluids thus can be used in 

diagnostic assays to evaluate the status of a tissue, 
as well as to provide a means for monitoring the 
efficacy of a therapy for tissue regeneration. 

25 The currently preferred means for detecting 

endogenous anti-morphogen antibodies is by means of a 
standard Western blot. See, for example, Molecular 
Cloning; A Laboratory Manual Sambrook et al . , eds • , 
Cold Spring Harbor Press, New York, 1989, particularly 

30 pages 18.60-18.75, incorporated herein by reference, 
for a detailed description of these assays. Purified 
mature or soluble morphogen is electrophoresed on an 
SDS polyacrylamide gel under oxidized or reduced 
conditions designed to separate the proteins in 
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solution, and the proteins then transferred to a 
polyvinyl idene difluoride micropenis membrane 
(0.45 fim pore sizes) using standard buffers and 
procedures. The filter then is incubated with the 
5 serum being tested (at various dilutions)* Antibodies 
bound to either the pro domain or the mature morphogen 
domain are detected by means of an anti-human antibody 
protein , e.g./ goat anti-human Ig. Titers of the 
antimorphogen antibodies can be determined by further 
10 dilution of the serum until no signal is detected. 

X. Formulations and Methods for Administering Soluble 
Morphogens as Therapeutic Agents 

15 The soluble morphogens of this invention are 

particularly useful as therapeutic agents to regenerate 
diseased or damaged tissue in a mammal, particularly a 
human. 

20 The soluble morphogen complexes may be used to 

particular advantage in regeneration of damaged or 
diseased lung, heart, liver, kidney, nerve or pancreas 
tissue, as well as in the transplantation and/or 
grafting of these tissues and bone marrow, skin, 

25 gastrointestinal mucosa, and other living tissues. 

The soluble morphogen complexes described herein 
may be provided to an individual by any suitable means, 
preferably directly or systemically, e.g., parenterally 
30 or orally. Where the morphogen is to be provided 

directly (e.g., locally, as by injection, to a desired 
tissue site), or parenterally, such as by intravenous, 
subcutaneous , intramuscular , intraorbital , ophthalmic , 
intraventricular , intracranial , intracapsular , 
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intraspinal, intracistemal, intraperitoneal, buccal, 
rectal, vaginal, intranasal or by aerosol 
administration, the soluble morphogen complex 
preferably comprises part of an aqueous solution. The 
5 solution is physiologically acceptable so that in 
addition to delivery of the desired morphogen to the 
patient, the solution does not otherwise adversely 
affect the patient's electrolyte and volume balance. 
The aqueous medium for the soluble morphogen thus may 
10 comprise normal physiologic saline (0.9% NaCl, 0.15M), 
pH 7-7.4. 

Soluble morphogens of this invention are readily 
purified from cultured cell media into a physiological 

15 buffer, as described above. In addition, and as 

described above, if desired, the soluble complexes may 
be formulated with one or more additional additives, 
including basic amino acids (e.g., L-arginine, lysine, 
betaine); non- ionic detergents (e.g. Tween-80 or 

20 NonIdet-120) and carrier proteins (e.g., serum albumin 
and casein). 

Useful solutions for oral or parenteral 
administration may be prepared by any of the methods 

25 well known in the pharmaceutical art, described, for 
example, in Remington's Pharmaceutical Sciences , 
(Gennaro, A., ed.), Mack Pub., 1990. Formulations may 
include, for example, polyalkylene glycols such as 
polyethylene glycol, oils of vegetable origin, 

30 hydrogenated naphthalenes, and the like. Formulations 
for direct administration, in particular, may include 
glycerol and other compositions of high viscosity. 
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Biocompatible, preferably bioresorbable polymers, 
including, for example, hyaluronic acid, collagen, 
tricalcium phosphate, polybutyrate, polylactide, 
polyglycolide and lactide/glycolide copolymers, may be 
useful excipients to control the release of the soluble 
morphogen in vivo . 

Other potentially useful parenteral delivery 
systems for these morphogens include ethylene-vinyl 
acetate copolymer particles, osmotic pumps, implantable 
infusion systems, and liposomes. Formulations for 
inhalation administration may contain as excipients, 
for example, lactose, or may be aqueous solutions 
containing, for example, polyoxyethylene-9-lauryl 
ether, glycocholate and deoxycholate, or oily solutions 
for administration in the form of nasal drops, or as a 
gel to be applied intranasally. 

The soluble morphogens described herein also may be 
administered orally. Oral administration of proteins 
as therapeutics generally is not practiced as most 
proteins readily are degraded by digestive enzymes and 
acids in the mammalian digestive system before they can 
be absorbed into the bloodstream. However, the mature 
domains of the morphogens described herein typically 
are acid-stable and protease-resistant (see, for 
example, U.S. Pat. No. 4,968,590.) In addition, at 
least one morphogen, OP-1, has been identified, in 
mammary gland extract, colostrum and milk, as well as 
saliva. Moreover, the OP-1 purified from mammary gland 
extract is morphogenically active. For example, this 
protein induces endochondral bone formation in mammals 
when implanted subcutaneously in association with a 
suitable matrix material, using a standard in vivo bone 
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assay, such as is disclosed in U.S. Pat. No. 4,968,590. 
In addition, endogenous morphogen also is detected in 
human serum (see above). Finally, comparative 
experiments with soluble and mature morphogens in a 
5 number of experiments defining morphogenic activity 
indicate that the non-covalent association of the pro 
domain with the dimeric species does not interfere with 
morphogenic activity. These findings indicate that 
oral and parenteral administration are viable means for 
10 administering morphogens to an individual, and that 
soluble morphogens have utility in systemic 
administration protocols. 

The soluble complexes provided herein also may be 

15 associated with molecules capable of targeting the 
morphogen to a desired tissue. For example, 
tetracycline and diphosphonates (bisphosphonates) are 
known to bind to bone mineral, particularly at zones of 
bone remodeling, when they are provided systemically in 

20 a mammal. Accordingly, these molecules may be included 
as useful agents for targeting soluble morphogens to 
bone tissue. Alternatively, an antibody or other 
binding protein that interacts specifically with a 
surface molecule on the desired target tissue cells 

25 also may be used. Such targeting molecules further may 
be covalently associated to the morphogen complex, 
e.g., by chemical crosslinking, or by using standard 
genetic engineering means to create, for example, an 
acid labile bond such as an Asp-Pro linkage. Useful 

30 targeting molecules may be designed, for example, using 
the single chain binding site technology disclosed, for 
example, in U.S. Pat. No. 5,091,513. 
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Finally, the soluble morphogen complexes provided 
herein may be administered alone or in combination with 
other molecules known to have a beneficial effect on 
tissue morphogenesis, including molecules capable of 
5 tissue repair and regeneration and/or inhibiting 
inflammation. Examples of useful cof actors for 
stimulating bone tissue growth in osteoporotic 
individuals, for example, include but are not limited 
to, vitamin D 3 , calcitonin, prostaglandins, parathyroid 

10 hormone, dexamethasone, estrogen and IGF-I or IGF-II. 
Useful cofactors for nerve tissue repair and 
regeneration may include nerve growth factors. Other 
useful cofactors include symptom-alleviating cofactors, 
including antiseptics, antibiotics, antiviral and 

15 antifungal agents and analgesics and anesthetics. 

The compounds provided herein can be formulated 
into pharmaceutical compositions by admixture with 
pharmaceutically acceptable nontoxic excipients and 

20 carriers. As noted above, such compositions may be 

prepared for parenteral administration, particularly in 
the form of liquid solutions or suspensions; for oral 
administration, particularly in the form of tablets or 
capsules; or intranasally, particularly in the form of 

25 powders, nasal drops or aerosols. Where adhesion to a 
tissue surface is desired the composition may include 
the morphogen dispersed in a fibrinogen-thrombin 
composition or other bioadhesive such as is disclosed, 
for example in PCT OS91/09275, the disclosure of which 

30 is incorporated herein by reference. The composition 
then may be painted, sprayed or otherwise applied to 
the desired tissue surface. 
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The compositions can be formulated for parenteral 
or oral administration to humans or other mammals in 
therapeutically effective amounts, e.g., amounts which 
provide appropriate concentrations of the morphogen to 
5 target tissue for a time sufficient to induce 

morphogenesis, including particular steps thereof, as 
described above « 

Where the soluble morphogen complex is to be used 
10 as part of a transplant procedure, the morphogen may be 
provided to the living tissue or organ to be 
transplanted prior to removal of the tissue or organ 
from the donor. The morphogen may be provided to the 
donor host directly, as by injection of a formulation 
15 comprising the soluble complex into the tissue, or 

indirectly, e.g., by oral or parenteral administration, 
using any of the means described above. 

Alternatively or, in addition, once removed from 
20 the donor, the organ or living tissue may be placed in 
a preservation solution containing the morphogen. In 
addition, the recipient also preferably is provided 
with the morphogen just prior to, or concommitant with, 
transplantation. In all cases, the soluble complex may 
25 be administered directly to the tissue at risk, as by 
injection to the tissue, or it may be provided 
systemically, either by oral or parenteral 
administration, using any of the methods and 
formulations described herein and/or known in the art. 



Where the morphogen comprises part of a tissue or 
organ preservation solution, any commercially available 
preservation solution may be used to advantage. A 
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useful preservation solution is described in in 
PCT/DS92/07358 (WO93/04692 ) , incorporated herein by 
reference. 

5 As will be appreciated by those skilled in the art, 

the concentration of the compounds described in a 
therapeutic composition will vary depending upon a 
number of factors, including the dosage of the drug to 
be administered, the chemical characteristics (e.g., 

10 hydrophobicity) of the compounds employed, and the 

route of administration. The preferred dosage of drug 
to be administered also is likely to depend on such 
variables as the type and extent of tissue loss or 
defect, the overall health status of the particular 

15 patient, the relative biological efficacy of the 

compound selected, the formulation of the compound, the 
presence and types of excipients in the formulation, 
and the route of administration. In general terms, the 
compounds of this invention may be provided in an 

20 aqueous physiological buffer solution containing about 
0.001 to 10% w/v compound for parenteral 
administration. Typical dose ranges are from about 10 
ng/kg to about 1 g/kg of body weight per day; a 
preferred dose range is from about 0.1 pg/kg to 

25 100 mg/kg of body weight. No obvious morphogen-induced 
pathological lesions are induced when mature morphogen 
(e.g., OP-1, 20 pg) is administered daily to normal 
growing rats for 21 consecutive days. Moreover, 10 pg 
systemic injections of morphogen (e.g., OP-1) inject ad 

30 daily for 10 days into normal newborn mice does not 
produce any gross abnormalities. 
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Where morphogens are administered systemically, in 
the methods of the present invention, preferably a 
large volume loading dose is used at the start of the 
treatment. The treatment then is continued with a 
5 maintenance dose. Further administration then can be 
determined by monitoring at intervals the levels of the 
morphogen in the blood. 

Other Embodiments 

10 

The invention may be embodied in other specific 
forms without departing from the spirit or essential 
characteristics thereof. The present embodiments are 
therefore to be considered in all respects as 

15 illustrative and not restrictive, the scope of the 

invention being indicated by the appended claims rather 
than by the foregoing description, and all changes 
which come within the meaning and range of equivalency 
of the claims are therefore intended to be embraced 

20 therein. 
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What is claimed is: 

!• Dimeric protein comprising a pair of protein 
subunits associated to defined a dimeric structure 
having morphogenic activity , 

each of said subunits comprising at least a 
100 amino acid sequence having a pattern of cysteine 
residues characteristic of the morphogen family, 

at least one of said subunits comprising a 
mature form of a subunit of a member of the morphogen 
family , or an allelic, species, or sequence variant 
thereof, noncovalently complexed with 

a peptide comprising a pro region of a member 
of the morphogen family, or an allelic, species, or 
sequence variant thereof to form a complex which is 
more soluble in aqueous solvents than the uncomplexed 
pair of subunits. 

2. The protein of claim 1 wherein both said subunits 
comprise a mature form of a subunit of a member of the 
morphogen family or an allelic, species, or sequence 
variant thereof, each said subunit being noncovalently 
complexed with a said peptide. 

3. The protein of claim 1 wherein each said subunit 
is the mature form of human OP-1, or a species or 
allelic variant thereof. 

4. The protein of claim 1, 2, or 3 wherein the 
peptide comprises the pro region of human OP-1, or a 
species, allelic or sequence variant thereof. 
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5. The protein of claim 1 wherein said peptide 
comprises at least the first 18 amino acids of an amino 
acid sequence defining said pro region. 

6. The protein of claim 1 wherein said peptide 
comprises at least the first 18 amino acids of an amino 
acid sequence defining said pro region in Seq. ID Nos. 
1*16 or a sequence variant thereof. 

7. The protein of claim 1 or 6 wherein said peptide 
comprises the full length form of said pro region. 

8. The protein of claim 1 wherein said pro region 
peptide comprises an amino acid sequence selected from 
sequences defined by residues 30-48, 30-292 ^and 48-292 
of Seq. ID No. 1. 

9. The protein of claim 1 wherein said pro region 
peptide comprises an amino acid sequence encoded by a 
nucleic acid that hybridizes under stringent conditions 
with a DNA encoding the N- terminal 18 amino acids of 
the pro region sequences for Seq. ID Nos. 1-19. 

10. The protein of claims 1 or 9 wherein said pro 
region peptide comprises a nucleic acid that hybridizes 
under stringent conditions with a DNA defined by 
nucleotides of 136-192 of Seq. ID No. 1 or nucleotides 
157-211 of Seq. ID No. 5. 

11. The protein of claim 1 wherein said subunit 
sequence variant comprises a chimeric morphogen amino 
acid sequence. 
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12. The protein of claim 1 wherein said peptide 
comprises a chimeric pro region amino acid sequence. 

13. The protein of claim 1 wherein said subunit 
comprises a sequence defined by Generic Sequence 7 or 
Generic Sequence 8. 

14. The protein of claim 1 wherein said subunit 
comprises a sequence having 60% amino acid identity 
with the sequence defined by residues 335-431 of Seq. 
ID No.l. 

15. The protein of claim 1 wherein said subunit 
comprises the mature form of a subunit defined by any 
of the sequences of Seq. ID No. 5-19. 

16. The protein of claim 1 wherein said subunit 
comprises an amino acid sequence encoded by a nucleic 
acid that hybridizes with a DNA defined by nucleotides 
1036-1341 of Seq. ID No. 1 or nucleotides 1390-1695 of 
Seq. ID No. 5. 

17. The protein of claim 1 further comprising an 
molecule capable of enhancing the stability of said 
complex • 

IB. A therapeutic composition comprising the protein 
of any of claims 1, 2, 5-9 or 11-17. 

19. A therapeutic composition comprising the protein 
of claim 1 wherein each said subunit is the mature form 
of human OP-1, or a species or allelic variant thereof. 
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20. A therapeutic composition comprising the protein 
of claim 1, wherein said peptide comprises part or all 
of the pro region of human OP-1, or a species or 
allelic variant thereof. 

21* The therapeutic composition of claim 18 comprising 
the protein of claim 1 wherein said subunit comprises 
the mature form of a subunit defined by any of the 
sequences of Seg. ID Nos. 5-19. 

22. A therapeutic composition comprising the protein 
of claims 3, 4 or 10. 

23. The therapeutic composition of claims 18 or 22 
further comprising a cof actor. 

24. The therapeutic composition of claim 23 wherein 
said cof actor is a symptom-alleviating cof actor. 

25. A kit for diagnosing a tissue disorder or 
evaluating the efficacy of a therapy to regenerate lost 
or damaged tissue in a mammal , the kit comprising: 

a) means for capturing a cell or fluid sample 
from said mammal, 

b) a binding protein capable of interacting 
specifically with a soluble morphogen complex in said 
sample, and 

c) means for detecting the binding protein bound 

to said soluble morphogen complex. <■ 

26. The kit of claim 25 wherein said binding protein 
is an antibody. 
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27. A method for evaluating the status of a tissue, 
the method comprising the step of comparing the 
quantity of morphogen in a body fluid sample with the 
quantity of morphogen in a control sample. 

28. A method for evaluating the efficacy of a therapy 
to regenerate lost or damaged tissue in a mammal, the 
method comprising the step of comparing the quantity of 
morphogen in a body fluid sample with the quantity of 
morphogen in a control sample. 

29. A method for diagnosing a tissue disorder in a 
mammal, the method comprising the step of comparing the 
quantity of morphogen in a body fluid sample with the 
quantity of morphogen in a control sample. 

30. The invention of claim 25, 26, 27 or 28 wherein 
said morphogen is a dimeric protein comprising a pair 
of protein subunits associated to defined a dimeric 
structure having morphogenic activity, 

each of said subunits comprising at least a 
100 amino acid sequence having a pattern of cysteine 
residues characteristic of the morphogen family, 

at least one of said subunits comprising a 
mature form of a subunit of a member of the morphogen 
family, or an allelic, species, or sequence variant 
thereof, noncovalently complexed with 

a peptide comprising a pro region of a member 
of the morphogen family, or an allelic, species, or 
sequence variant thereof to form a complex which is 
more soluble in aqueous solvents than the uncomplexed 
pair of subunits. 
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31. The invention of claims 25, 26 , 27 or 28 wherein 
said quantity of morphogen is detected by an 
immunoassay. 

32. The invention of claims 25 , 26, 27 or 28 wherein 
said quantity of morphogen is detected by an antibody 
capable of distinguishing soluble morphogen in a sample 
fluid. 

33. The invention of claims 25, 26, 27 or 28 wherein 
said body fluid sample comprises serum. 

34. The invention of claims 25 or 28 wherein said 
tissue disorder is a bone tissue disorder. 

35. The invention of claim 34 wherein said bone tissue 
disorder is selected from the group consisting of 
osteosarcoma, osteoporosis, and Paget 's disease. 

36. A method of evaluating the status of a tissue, the 
method comprising the step of detecting the presence of 
anti-morphogen antibody in a tissue or body fluid 
sample. 

37. A method for evaluating the efficacy of a therapy 
to regenerate lost or damaged tissue, the method 
comprising the step of detecting the presence of anti- 
morphogen antibody in a tissue or body fluid sample. 

38. A method for diagnosing a tissue disorder, the 
method comprising the step of detecting the presence of 
anti-morphogen antibody in a tissue or body fluid 
sample. 
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39. A kit for diagnosing a tissue disorder or 
evaluating the efficacy of a therapy to regenerate lost 
or damaged tissue in a mammal, the kit comprising: 

a) means for capturing a cell or fluid sample 
from said mammal; 

b) a binding protein capable of interacting 
specifically with an endogenous anti-morphogen antibody 
in said sample; and 

c) means for detecting said binding protein-bound 
to said endogenous anti-morphogen antibody. 
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